Faithful Variable Screening for High-dimensional Convex Regression By
نویسندگان
چکیده
We study the problem of variable selection in convex nonparametric regression. Under the assumption that the true regression function is convex and sparse, we develop a screening procedure to select a subset of variables that contains the relevant variables. Our approach is a two-stage quadratic programming method that estimates a sum of one-dimensional convex functions, followed by one-dimensional concave regression fits on the residuals. In contrast to previous methods for sparse additive models, the optimization is finite dimensional and requires no tuning parameters for smoothness. Under appropriate assumptions, we prove that the procedure is faithful in the population setting, yielding no false negatives. We give a finite sample statistical analysis, and introduce algorithms for efficiently carrying out the required quadratic programs. The approach leads to computational and statistical advantages over fitting a full model, and provides an effective, practical approach to variable screening in convex regression.
منابع مشابه
Marginal Regression For Multitask Learning
Variable selection is an important and practical problem that arises in analysis of many high-dimensional datasets. Convex optimization procedures that arise from relaxing the NP-hard subset selection procedure, e.g., the Lasso or Dantzig selector, have become the focus of intense theoretical investigations. Although many efficient algorithms exist that solve these problems, finding a solution ...
متن کاملNootropic Medicinal Plants; Evaluating Potent Formulation By Novelestic High throughput Pharmacological Screening (HTPS) Method
The principle of this method was to screen the pharmacological activity of six prepared polyphyto formulations by using high throughput screening method for their nootropic action. The study was performed in three stages using one, two and three animals, respectively in a group. Test formulations were given p.o daily at the dose of 50 and 100 mg/kg body weight. The test formulations were compar...
متن کاملTwo tales of variable selection for high dimensional regression: Screening and model building
Variable selection plays an important role in high dimensional regression problems where a large number of variables are given as potential predictors of a response of interest. Typically, it arises at two stages of statistical modeling, namely screening and formal model building, with different goals. Screening aims at filtering out irrelevant variables prior to model building where a formal d...
متن کاملShape-Constrained Estimation in High Dimensions
Shape-constrained estimation techniques such as convex regression or log-concave density estimation offer attractive alternatives to traditional nonparametric methods. Shape-constrained estimation often has an easyto-optimize likelihood, no tuning parameter, and an adaptivity property where the sample complexity adapts to the complexity of the underlying functions. In this dissertation, we posi...
متن کاملMLSLR: Multilabel Learning via Sparse Logistic Regression
Multilabel learning, an emerging topic in machine learning, has received increasing attention in recent years. However, how to effectively tackle high-dimensional multilabel data, which are ubiquitous in real-world applications, is still an open issue in multilabel learning. Although many efforts have been made in variable selection for traditional data, little work concerns variable selection ...
متن کامل